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METHODS FOR IMPROVING SENSITIVITY 
AND SPECIFICITY OF SCREENING ASSAYS 

This application is a continuation-in-part of U.S.S.N. 08/876,857, which is a 
continuation-in-part of U.S.S.N. 08/700,583 (U.S. Patent No. 5,670,325). 

FIELD OF THE INVENTION 

The invention relates to assays to detect nucleic acid markers of cancer with 
5 high specificity and high sensitivity for detection. 

BACKGROUND OF THE INVENTION 

Cancer is thought to arise from a multi-step process that typically involves 
multiple genetic mutations leading to uncontrolled cell growth. Many cancers are 
curable if detected early in their development. For example, colorectal cancers typically 

10 originate in the colonic epithelium, and are not extensively vascularized (and therefore 
not invasive) during early stages of development The transition to a highly- 
vascularized, invasive and ultimately metastatic cancer commonly takes ten years or 
longer. If the presence of cancer is detected prior to extensive vascularization, surgical 
removal typically is an effective cure. However, colorectal cancer is often detected only 

is upon manifestation of clinical symptoms, such as pain and bloody stool. Generally, 
such symptoms are present only when the disease is well established, and often after 
metastasis has occurred. Similarly, with the exception of the Pap smear for detection of 
pre-malignant cervical lesions, diagnostic screening methods for other types of cancer 
are best at detecting established disease. 

20 Most diagnostic assays for cancer are invasive, or at least uncomfortable. 

Invasive procedures range from performing a tissue biopsy to surgery. Cancer 
screening procedures frequently result in significant patient discomfort. For example, 
magnetic resonance imaging requires confinement of the patient, and colonoscopy 
requires sedation. The discomfort associated with typical invasive screening methods 

25 reduces patient compliance with routine screening procedures. 

Moreover, screening for the early detection of cancer (i.e., prior to the onset of 
symptoms and/or metastasis) often results in an intolerable level of false positive and 
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false negative results. The likelihood of a false positive test result is a function of the 
specificity of the test. The specificity of a test (expressed as a percentage) is the 
likelihood that any individual who is negative for disease tests negative for that disease. 
On the other hand, false negative results are a property of the sensitivity of the test. 

5 Sensitivity (also expressed as a percentage) provides the likelihood that a test for a 
specific disease will identify as positive an individual having that disease. Thus, 5% of 
the individuals determined to have a disease using an assay with a specificity of 95% 
will actually not have the disease. Similarly, an assay with a sensitivity of 95% will 
incorrectly identify a diseased individual as disease-free 5% of the time. 

10 The problems of sensitivity and specificity are exaggerated in assays for the 

early detection of cancer because patient samples on which such early detection is 
performed typically contain relatively small amounts of cancerous cellular material in 
relation to non-cancerous cellular material. In many cases, patient samples are a 
heterogeneous mixture of large amounts of normal cells and small amounts of 

15 cancerous cells. A good example of such a heterogeneous sample is stool. The typical 
stool sample contains cells and cellular debris sloughed from the colonic epithelium, by- 
products of digestion, and bacteria. In its early stages, colorectal cancer is thought to 
affect only about 1% of colonic epithelial cells. Any attempt to detect nucleic acids from 
the 1% of affected cells in the heterogeneous background of the stool sample might 

20 give rise to very low sensitivities. Attempts to identify the presence of the indicia of 
cancer in other heterogeneous samples, such as sputum, pus, urine, nipple aspirate, 
etc., presents similar problems. 

Recently, a number of genetic mutations have been associated with cancer. For 
example, alterations in the p53 gene, the Kras oncogene, and the ape tumor 

25 suppressor gene are thought to be participants in the multi-step pathway leading to 
cancer. It has been suggested that mutations in those genes might be a basis for 
molecular screening assays for the early stages of certain types of cancer. See e.g., 
Sidransky, et al., Science, 256: 102-105 (1992). Attempts have been made to identify 
and use nucleic acid markers that are indicative of cancer. However, even when such 

30 markers are found, using them to screen patient samples, especially heterogeneous 
samples, has proven unsuccessful either due to an inability to obtain sufficient sample 
material, or due to the low sensitivity that results from measuring only a single marker. 
For example, simply obtaining adequate human DNA from one type of heterogeneous 
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sample (stool) has proven difficult. See Villa, et al., Gastroenterol., 110: 1346-1353 
(1996) (reporting that only 44.7% of all stool specimens, and only 32.6% of stools from 
healthy individuals produced sufficient DNA for mutation analysis). Other reports in 
which adequate DNA has been obtained have reported low sensitivity in identifying a 
5 patient's disease status based upon a single cancer-associated mutation. See Eguchi, 
et al., Cancer, 77: 1707-1710 (1996) (using a p53 mutation as a marker for cancer). 

Accordingly, there is a need in the art for high-sensitivity, high-specificity assays 
for the detection of molecular indicia of cancer, especially in heterogeneous samples. 

SUMMARY OF THE INVENTION 

10 Methods of the invention solve the problem of obtaining accurate (high sensitivity 

and high specificity) results in an assay for indicia of cancer or precancer in a 
heterogeneous sample. 

The present invention provides assays conducted on samples obtained non- 
invasive^ or minimally-invasively for cancer or precancer in which the assays have a 

15 high sensitivity for detection of cancer or precancer when it is present in a patient 

sample, and a high specificity against false positive results. In a preferred embodiment, 
methods of the invention provide the benefits of high sensitivity and high specificity in 
an assay to detect a small amount of a cancer marker (e.g., a nucleic acid) in a 
heterogeneous sample having predominantly non-cancerous cells and cellular debris. 

20 Accordingly, such methods are especially useful for early detection of cancer or 
precancer Methods of the invention greatly increase the accuracy of molecular 
screening and diagnostic assays for the early detection of cancer or precancer. 

The present invention contemplates that one reason prior art non-invasive 
methods (i.e., methods conducted on samples obtained non-invasively or minimally 

25 invasively) for detecting molecular indicia (especially nucleic acid mutations) of cancer 
have failed to provide satisfactory results is that such methods have not addressed 
maintaining high specificity and/or high sensitivity in the assay. Such methods also fail 
to recognize the benefits of combining high sensitivity and high specificity features in an 
assay to detect early indicia of cancer, especially when the detection is performed in a 

30 heterogeneous sample. The present invention recognizes that screening assays for 
indicia of cancer, especially early-stage cancer (e.g., when cancer indicia represents 
about 1% of the cells and cellular debris in an appropriate sample as discussed below), 
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are improved by increasing the specificity and/or the sensitivity of the assay. As 
described below, the specificity and sensitivity of molecular assays can be improved in 
several ways. 

In a preferred embodiment, methods of the invention comprise selecting one or 
5 more nucleic acid target(s) that is (are) suspected to be mutated in cancer or 

precancer, identifying in a biological sample the presence of the one or more selected 
target(s), such that the probability of identifying as positive any sample comprising 
target is at least about 44%, and preferably about 50%, and most preferably between 
about 60% and about 95%, and the probability of identifying as negative a sample not 
io having any target is at least about 85%, and preferably between about 90% and about 
100%. 

In another preferred embodiment, the present invention provides methods for 
improving the sensitivity and specificity of molecular cancer screening assays by 
analyzing a plurality of target nucleic acids suspected to be mutated in cancer or 
15 precancer, such that the cumulative sensitivity of analyzing the plurality of targets is at 
least about 50%, and the cumulative specificity of analyzing the plurality of targets is at 
least about 85%. 

Methods of the invention also comprise selecting nucleic acid mutations for 
analysis in a cancer or precancer screening assay based upon the probability of 

20 occurrence of the mutations in cancer patients, and such that the probability of 

identifying at least one of the mutations in a patient sample having at least one of the 
mutations is between about 50% and about 95%. Methods of the invention also 
comprise selecting one or more nucleic acid mutations in order to optimize the 
sensitivity and specificity of a cancer or precancer screening assay, and conducting the 

25 assay in order to diagnose cancer or precancer. In a preferred embodiment, screening 
assays of the invention analyze at least two, and preferable between about 3 and about 
20 markers (e.g., mutations, loss of heterozygosity, sequence length variations, nucleic 
acid molecular weight variations, variations in amounts of amplifiable nucleic acid) in 
patient samples in order to improve the sensitivity and specificity of detection. Such 

30 "multiple target" assays allow improved sensitivity because the increased number of 
markers that are analyzed decreases the likelihood that a patient presenting with indicia 
of cancer or precancer will be misdiagnosed as negative (i.e., as not having cancer). 
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Methods of the invention further comprise selecting nucleic acid markers 
suspected to be indicative of cancer or precancer, validating the informativeness of the 
markers in order to select a subset of markers that provide high sensitivity and 
specificity in an assay for the detection of cancer or precancer in patient samples, and 

5 screening patient samples for the presence of the selected subset of markers. In a 
preferred embodiment, the nucleic acid markers are mutations known or suspected to 
be indicative of cancer or precancer. Also in a preferred embodiment, the validating 
step comprises determining the extent to which detection of a marker used in 
connection with a particular screening assay predicts whether a patient from whom the 

10 sample was obtained has cancer or precancer. Preferably, such validating methods 
are performed by comparing results from samples obtained from a first group of 
patients known to have cancer or precancer and those obtained from a second group 
known not to have cancer or precancer as determined by an accepted standard 
(typically invasive) method. 

15 Methods of the invention also comprise combining detection of a mutation known 

or suspected to be associated with cancer or precancer with detection of loss of 
heterozygosity at a relevant genomic locus. The combination of analyzing both 
mutations in cancer-associated nucleic acids, and loss of heterozygosity increases the 
specificity and sensitivity of any screening assay for cancer or precancer. In a preferred 

20 embodiment, such methods comprise validating selected loci for use in a screening 
assay such that the assay produces results with high sensitivity and high specificity. In 
a preferred embodiment, the sensitivity of the assay is at least 50%, and the specificity 
of the assay is at least 85%. 

Methods of the invention comprise combining two or more assays for molecular 

25 indicia of cancer or precancer in order to achieve a desired level of informativeness. 
For example, in a preferred embodiment, methods of the invention comprise combining 
two or more assays selected from quantitative PCR ( multiple mutation analysis, 
detection of loss of heterozygosity, and hybrid capture of one or more mutant nucleic 
acid markers. Accordingly, increased sensitivity and specificity is observed when an 

30 assay based upon the amount of amplifiable DNA is combined with an assay for a 
particular cancer-associated mutation. Examples of these "combination assays", and 
their resulting sensitivities and specificities, are provided in the detailed description. 
Such methods are especially useful when applied to a heterogeneous sample in which 
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the nucleic acid to be detected is present in a very small amount relative to other 
nucleic acids (as well as other molecules) in the sample. 

Methods of the invention also make use of the "informativeness" of 
markers used therein. The Informativeness of a nucleic acid marker relates to the 

5 likelihood of finding the marker in a positive sample. Thus, if a particular mutation, for 
example a mutation in codon 12 of K-ras, has an informativeness for cancer of 56%, 
this means that 56% of positive patient samples (i.e., those taken from patients who 
have cancer) have the K-ras mutation. Methods of the invention combine the use of 
informative markers (e.g., mutations) and high sensitivity/specificity assays in order to 

10 provide reliable screening assays for early diagnosis of cancer or precancer, especially 
in heterogeneous samples. For purposes of the present invention a mutation is a 
deletion, addition, substitution, rearrangement, or translocation in a nucleic acid. A loss 
of heterozygosity is a form of mutation in which all or a portion of one allele is deleted. 
Also for purposes of the present invention, the terms "markers", "targets", and 

15 "mutations" include nucleic acid (especially DNA) mutations (substitutions, additions, 
rearrangements, translocations, deletions, etc.), as well as other nucleic acid indicia 
useful in methods of the invention. Such indicia include the amount of amplifiable 
nucleic acid in a sample, the length of nucleic acids in a sample, the ratio of long 
nucleic acids (greater than about 200 bp) to short nucleic acids (less than about 200 

20 bp), and any other nucleic acid variations that differ in patients with cancer and disease- 
free patients. Also for purposes of the present invention, the terms "healthy" or 
"disease-free" are intended to mean a patient who does not have cancer or precancer. 

Stool is a good example of a heterogeneous sample in which methods of the 
invention are especially useful. A typical stool sample contains patient nucleic acids, 

25 but also contains heterologous nucleic acids, proteins, and other cellular debris 

consistent with the lytic function of the various nucleases, proteinases, etc. found in the 
colon. As stool proceeds from the proximal colon to the distal colon, it (under normal 
circumstances) solidifies. As the solidifying stool passes through the colon, colonic 
epithelial cells are sloughed onto the stool. If a patient has a developing tumor or 

30 adenoma, cells from the tumor or adenoma will also be sloughed onto stool, and they 
(or their debris) will contain molecular indicia of disease (e.g., mutations or loss of 
heterozygosity). In the early stages of development, nucleic acid indicative of an 
adenoma or tumor comprise only about 1% of the nucleic acid in a voided stool. If a 
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patient is left untreated, proportionately more disease-related nucleic acids are found in 
stool over time. Methods of the invention are useful for detecting early-stage lesions in 
heterogeneous samples such as stool. Methods of the invention result in a high degree 
of sensitivity and specificity for the detection of early-stage disease. Methods of the 
5 invention are especially useful in detecting, for example, adenomas in the colon. 

Adenomas are non-metastatic lesions that frequently have the potential for metastasis. 
If all adenomas in a patient are detected and removed, the probability of complete cure 
is virtually certain. 

In a preferred embodiment, nucleic acids or nucleic acid mutations having a high 

10 degree of informativeness are chosen. One or more assay(s) is (are) conducted to 
reliably detect one or more informative nucleic acids, and a diagnosis is made based 
upon the presence in a patient sample of any one of the informative nucleic acids. In a 
preferred embodiment, nucleic acids are chosen as targets for analysis in methods of 
the invention based upon their length and/or sequence characteristics. For example, it 

15 has now been discovered that the quantity and/or length of nucleic acids in stool (a 
prototypical heterogeneous sample) presents a high degree of informativeness 
regarding a patient's disease status. Patients having, for example, an adenoma 
produce stool specimens containing more and longer DNA than specimens produced 
by healthy patients. Moreover, a number of highly-informative DNA mutations are 

20 useful in methods of the invention. These include, mutations in the oncogene, Kras 
(especially mutations at codons 12 and 13); mutations in the cell-cycle regulator, p53; 
mutations in the ape gene; mutations in the bat-26 segment of the MSH2 mismatch 
repair gene; and loss of heterozygosity (typically indicated by massive loss of DNA in 
one allele but not the other). Finally, methods of the invention provide that the 

25 informativeness of an assay is increased by screening multiple mutations 
simultaneously or in sequence, or by combining different assays. 

In another preferred embodiment, methods of the invention provide informative 
molecular assays for cancer or precancer by providing samples for analysis that have 
sufficient amplifiable nucleic acid. Thus, in one embodiment, methods of the invention 

30 comprise screening samples for amplifiable DNA; classifying samples based upon the 
amount of DNA capable of being amplified from them; and further screening samples 
with a predetermined threshold of amplified or amplifiable DNA for the presence of a 
mutation indicative of cancer or precancer. 
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ln another preferred embodiment, methods of the invention comprise selecting 
one or more mutational events that are indicative of cancer or precancer, such that the 
combined informativeness of the one or more events meets or exceeds a 
predetermined or desired level of informativeness. The informativeness of any 

5 mutation or combination of mutations may be validated by an accepted invasive 
screening technique. For example, in methods to detect colorectal cancer, the 
informativeness of a molecular assay may be determined by identification of a lesion 
using colonoscopy, 

A detailed description of certain preferred embodiments of the invention is 

io provided below. Other embodiments of the invention are apparent upon review of the 
detailed description that follows. 

DETAILED DESCRIPTION OF THE DRAWINGS 

Figure 1 is a polyacrylamide gel showing exemplary "A", "B", "C", and 41 F" 
amplifications for use in assays to determine an amount of amplifiable nucleic acid in a 
15 sample. Lanes 1 , 4, 5, 6, and 7 show an "A" amplification, lane 2 shows a "C" 

application, lane 3 shows a "B n amplification, and lane 8 shows an "F" amplification (no 
amplifiable DNA). 
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DETAILED DESCRIPTION OF THE INVENTION 

Methods of the invention provide non-invasive or minimally-invasive assays for 
the detection of cancer or precancer at early stages of disease. Methods of the 
invention are especially useful in detecting cancer or precancer in heterogeneous 
5 biological samples. Preferred methods comprise identifying in a patient sample one or 
more nucleic acid mutations(s) that provide high sensitivity and high specificity for 
detection of the indicia of cancer or precancer. Methods of the invention may comprise 
identifying mutations having a known informativeness for cancer or precancer, or may 
be based upon validating selected mutations or assays to detect them with respect to a 

10 standard assay for cancer. By utilizing cancer or precancer markers having a high 
sensitivity/specificity for detecting the presence of cancer or precancer, methods of the 
invention provide improvements in non-invasive or minimally-invasive molecular 
screening assays. For purposes of the present invention, non-invasive or minimally- 
invasive means that specimens for analysis are obtained either from bodily excretions 

15 (e.g. stool, pus, sputum) or bodily fluids such as blood aspirate, or lymph. 

The invention will be exemplified with experiments to detect the presence of 
indicia of colorectal cancer or precancer in samples prepared from patient stool 
specimens. However, the skilled artisan recognizes that methods of the invention can 
be practiced using a variety of different samples in order to detect a variety of cancers. 

20 A reason that detection of colorectal cancer or precancer (e.g., an adenoma) is 

exemplified is that a stool specimen is a good example of a heterogeneous 
environment in which methods of the invention are especially useful (see above). 
Moreover, colonoscopy (and sigmoidoscopy, a related technique) is a well-known 
invasive standard that has a high sensitivity and high specificity (although high cost and 

25 low patient compliance) with which methods of the invention can be compared and 
validated. 

Methods of the invention comprise screening a sample, such as one prepared 
from a stool specimen, for the presence of one or more marker(s) of cancer or 
precancer (e.g., a colorectal tumor or adenoma), such that the sensitivity of detection is 
30 between about 50% and about 100%, and the specificity of detection is between about 
85% and about 100%. In a preferred embodiment, methods of the invention combine 
different types of assays in order to achieve an overall increase in sensitivity and 
specificity. Thus, methods of the invention comprise conducting an assay for a 
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mutation known to be associated with cancer or precancer, and an assay for a quantity 
and/or length of DNA expected to occur in cancer or precancer in order to obtain the 
combined benefits of the sensitivity and specificity of both assays. Moreover, 
embedded within the concept of utilizing multiple nucleic acid analyses to detect cancer 

5 or precancer is the use of multiple genomic targets in each assay in order to provide 
further increases in sensitivity and specificity. However, as shown below, a single- 
marker assay is sufficient for practice of the invention if its sensitivity and specificity are 
within the ranges taught herein. 

The genomic targets and assay methods used according to the invention can 

10 vary depending upon the desired level of sensitivity and specificity, as well as the type 
of cancer or precancer the detection of which is desired. Genomic targets (e.g., 
mutations) are selected based upon their known sensitivity or specificity or by 
determining a baseline sensitivity and specificity. In preferred embodiments, methods 
of the invention comprise the detection of a mutation at a single, informative locus. In 

15 other embodiments, assays for informative loci are combined in order to achieve 
improved sensitivity and specificity of detection relative to invasive techniques. 
Accordingly, methods of the invention contemplate a combination of assays selected 
from multiple mutation detection, quantitative polymerase chain reaction (i.e., to 
determine the amount of amplifiable DNA in a sample), sequence-specific hybrid 

20 capture, oligo-ligation, amplification refractory mutation system, single-stranded 

conformational polymorphism detection, sequencing, mismatch detection, and single 
base extension. Target loci include chromosomes 1, 5, 8, 17, and 18, particularly 
chromosome 5q, chromosome 17p, chromosome 8p, chromosome 1q, and 
chromosome 18q. Preferred loci for use in methods of the invention include p53, ape, 

25 bat-26, and others suspected to be predictive of cancer or precancer. 

Other genes are known to be associated with colorectal cancer, and their 
sensitivity and specificity are determined when not known in the literature by 
determining the percentage of tumors bearing the mutation, and the percentage of 
healthy specimens that bear the mutation from a sufficiently large and diverse 

30 population. This can be done empirically, or mathematically using algorithms that 
predict the likelihood of false positive and false negative screening results based upon 
data relating the presence of a mutation to the presence of cancer or precancer. 
Confirmation of a patient's clinical status can be accomplished by a standard test such 
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as colonoscopy in the case of colorectal cancer (which has a typical sensitivity of 95% 
and a typical specificity of 100%). 

For the analysis of stool samples, preferred methods of the invention comprise 
obtaining at least a cross-section or circumfrential portion of a voided stool as taught in 

5 U.S. patent number 5,741,650, and co-pending, co-owned U.S. patent application serial 
number 09/059,718, both of which are incorporated by reference herein. While a cross- 
sectional or circumfrential portion of stool is desirable, methods provided herein are 
conducted on random samples obtained from voided stool, which include smears or 
scrapings. Once obtained, the stool specimen is homogenized. A preferable buffer for 

10 homogenization is one that contains at least 16mM ethylenediaminetetraacetic acid 
(EDTA). However, as taught in co-pending, co-owned U.S. patent application serial 
number 60/122,177, incorporated by reference herein, it has been discovered that the 
use of at least 150mM EDTA greatly improves the yield of nucleic acid from stool. 
Thus, a preferred buffer for stool homogenization comprises phosphate buffered saline, 

15 20-100 mM NaCI or KCI, at least 150mM EDTA, and optionally a detergent (such as 
SDS) and a proteinase (e.g., proteinase K). 

After homogenization, nucleic acid is preferably isolated from the stool sample. 
Isolation or extraction of nucleic acid is not required in all methods of the invention, as 
certain detection techniques can be adequately performed in homogenized stool 

20 without isolation of nucleic acids. In a preferred embodiment, however, homogenized 
stool is spun to create a supernatant containing nucleic acids, proteins, lipids, and other 
cellular debris. The supernatant is treated with a detergent and proteinase to degrade 
protein, and the nucleic acid is phenol-chloroform extracted. The extracted nucleic 
acids are then precipitated with alcohol. Other techniques can be used to isolate 

25 nucleic acid from the sample. Such techniques include hybrid capture, and 

amplification directly from the homogenized stool. Nucleic acids can be purified and/or 
isolated to the extent required by the screening assay to be employed. 

Nucleic acids to be analyzed are chosen based upon known or suspected 
relationships between specific mutations and cancer or precancer. If desired, 

30 sequence-specific hybrid capture is used to isolate specific nucleic acids from the 

sample. Target nucleic acids may be analyzed by any method of the art. Examples of 
preferred methods include enumerative analysis of the loss of heterozygosity as taught 
in U.S. patent number 5,670,325, incorporated by reference herein. Enumerative 
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methods do not require knowledge of the sequence of a mutant nucleic acid. Rather 
such methods determine that there has been an alteration (deletion, substitution, 
addition, rearrangement, or other mutation) in a wild-type nucleic acid. The investigated 
loci are chosen based upon the likelihood of an alteration being associated with cancer 

5 or precancer. Enumerative methods compare the number in a sample of a wild-type 
nucleic acid known not to be altered in cancer or precancer with the number of a wild- 
type nucleic acid known or suspected to be altered in cancer or precancer. A 
statistically-significant difference in the two numbers indicates a positive screen. 

Mutations in target nucleic acids may also be measured by single base extension 

10 techniques to identify a single nucleotide variant indicative of cancer or precancer. 
Preferably, single base extension assay are cycled as taught in co-owned, co-pending 
U.S. patent application serial number 09/067,212, incorporated by reference herein. 
Briefly, cycled single base extension reactions comprise annealing a nucleic acid primer 
immediately 5' to a region containing a single base to be detected. The single base to 

is be detected represents a marker for mutation. The mutation may be a single point 
mutation or may be a larger mutation for which the single base is a marker. Two 
separate reactions are conducted. In the first reaction, primer is annealed to target, 
and labeled (preferably 32 P) nucleic acids complementary to non-wild type (e.g. mutants 
indicative of disease) variants at the single base to be detected, and unlabeled dideoxy 

20 nucleic acids complementary to the wild-type base are combined. Primer extension is 
stopped the first time a wild-type (dideoxy) base is added to the primer. Presence of 
label in the extended primer is indicative of the presence of a mutation. A second tube, 
the positive control contains labeled nucleic acid complementary to the wild-type base 
in the presence of the primer. A DNA polymerase, such as Sequenase™ (Amersham), 

25 is used for primer extension. In a preferred embodiment, a thermostable polymerase, 
such as Taq or thermal sequenase is used to allow more efficient cycling. Once an 
extension reaction is completed, the first and second probes bound to target nucleic 
acids are dissociated by heating the reaction mixture above the melting temperature of 
the hybrids. The reaction mixture is then cooled below the melting temperature of the 

30 hybrids and additional primer is permitted to associate with target nucleic acids for 
another round of extension reactions. In a preferred embodiment, 10 to 50 cycles of 
extension reactions are conducted. In a most preferred embodiment, 30 cycles of 
extension reactions are conducted. After completion of all cycles, extension products 
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are isolated and detected. In alternative embodiments, chain-terminating methods 
other than dideoxy nucleotides may be used. For example, chain termination occurs 
when no additional bases are available for incorporation at the next available nucleotide 
on the primer. 

5 Methods of the invention are also useful for screening populations of patients in 

order to identify characteristics in population samples that are indicative of cancer or 
adenoma. For example, methods of the invention comprise high sensitivity, high 
specificity screening of populations of patients in order to correlate nucleic acid 
mutations present in a subset of patient samples with the presence of disease in those 

10 patients. Thus, methods of the invention comprise detecting genomic variations in 
patient samples, correlating those variations with confirmed disease, and using the 
variations associated with confirmed disease as a diagnostic screen for the disease in 
subsequent patient samples. Such methods preferably are performed on pooled 
samples, such as stool samples, from identified populations of patients (e.g., diseased, 

15 healthy). Such methods are preferably based upon variations in single nucleotide 
polymorphic loci. The sensitivity and specificity of detecting variants in those loci as a 
function of disease is determined. Those loci that predict disease at predefined levels 
of sensitivity and specificity are selected for use in screening assays for unknown 
patient samples. 

20 The following examples provide specific exemplification of the concepts 

discussed above. The examples utilize a subset of mutational events that are shown to 
be predictive of disease. Other mutations are contemplated to function as high 
sensitivity, high specificity diagnostic or screening markers in assays of the invention. 
Moreover, the assays exemplified below are for purposes of illustration. The invention 

25 contemplates a variety of assays useful to screen patient samples for cancer or 
precancer as long as the assays provide a predetermined level of sensitivity and 
specificity of at least about 44% and at least about 85%, respectively. 

EXAMPLE 1 : Multiplex Screening of Stool Samples 

An experiment was conducted to determine the effects of multiple mutation 
30 analysis alone on the sensitivity and specificity of cancer or precancer detection in the 
stool specimens described above. Fifteen mutations, all known or suspected to occur 
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in colorectal cancer or precancer were used to screen the 40 patient samples. The 
following table catalogs the mutations that were assayed. 

TABLE 1 



Gene 


Mutation 


Kras, codon 13 


Position 2, G to A 


Kras, codon 1 2 


Position I.Gto A 


Kras, codon 12 


Position 2, G to A 


Ape, codon 1450 


Position l.CtoT 


Ape, codon 1378 


Position l.CtoT 


Ape, codon 1367 


Position I,CtoT 


Ape, codon 1309 


Deletion of 5 base pairs 


P53, codon 175 


Position 2, G to tA 


P53, codon 273 


Position l,CtoT 


P53, codon 273 


Position 2, G to A 


P53, codon 282 


Position l,CtoT 


P53, codon 245 


Position 1, G to A 


P53, codon 245 


Position 2, G to A 


P53, codon 248 


Position l,CtoT 


P53, codon 248 


Position 2, G to A 



According to methods of the invention, multiple mutation analysis is a preferred 

io means for increasing the sensitivity and specificity of cancer or precancer detection. 
For example, there are cumulative benefits of combining the informativeness (see 
above) of a mutation at one allele (e.g., Kras, codon 13, position 2) with the 
informativeness of a second allele (e.g., Kras codon 12, position 1, or ape codon 1450, 
position 1) in order to increase the overall sensitivity and specificity of the assay. 

15 Accordingly, the benefits of one aspect of the invention are presented below. 

Stool specimens were collected from 40 individuals who presented at the Mayo 
Clinic (Rochester, MN) with symptoms or history indicating that a colonoscopy should 
be performed. Each stool sample was frozen. Immediately after providing a stool 
sample, all individuals were given a colonoscopy in order to determine their disease 

20 status. Colonoscopy, an invasive test requiring sedation of the patient, has a sensitivity 
approaching 95%, and a specificity of nearly 100% for the diagnosis of colonic 
neoplasia. Based upon the colonoscopy results and subsequent histological analysis of 
biopsy samples taken during colonoscopy, individuals were placed into one of three 
groups: normal, cancer, and adenoma. An adenoma, or polyp, is considered clinically 

25 relevant if it has a diameter of 1 cm or greater. Thus, all individuals in the adenoma 
group had a polyp of at least 1 cm in diameter. Patients in the cancer group had 
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tumors diagnosed as cancer, and the disease-free individuals were those for whom 
colonoscopy showed no sign of cancer or adenoma. Based upon the colonoscopy 
results, 21 patients were diagnosed with cancer, 9 patients were diagnosed with an 
adenoma greater than 1 cm, and 10 patients were free of cancer or adenoma. 

5 Multiple mutation analysis was then performed, on a blinded basis (i.e., scientists 

performing the assays did not know the results of colonoscopy or histology), on each 
sample. Each frozen stool specimen, weighing from 7-33 grams, was thawed, 
homogenized in 500 mM Tris ( 16 mM EDTA, and 10 mM NaCI, pH 9.0, at a volume to 
mass ratio of about 3:1 . Samples were then rehomogenized in the same buffer to a 

10 final volume-to-mass ratio of 20:1, and spun in glass macro beads at 2356 x g. The 
supernatant was collected and treated with SDS and proteinase k. The DNA was then 
phenol-chloroform extracted and precipitated with alcohol. The precipitate was 
suspended in 10 mM Tris and 1 mM EDTA (1 x TE), pH 7.4. Finally, the DNA was 
treated with Rnase. 

15 Human DNA was isolated from the precipitate by sequence-specific hybrid 

capture. Biotynilated probes against portions of the p53, K-ras, and ape genes were 
used. The K-ras probe was S'GTGGAGTATTTGATAGTGTATTAACCTTATGTGTGAC 
3* (SEQ ID NO: 1). There were two ape probes: apc-1309 was 
5'TTCCAGCAGTGTCACAGCACCCTAGAACCAAATCCAG 3' (SEQ ID NO: 2), and 

20 apc-1 378 was 5'CAGATAGCCCTGGACAAACAATGCCACGAAGCAGAAG 3' (SEQ ID 
NO: 3). There were four probes against p53, the first (hybridizing to a portion of exon 
5) was 5TACTCCCCTGCCCTCAACAAGATGT^TTGCCAACTGG3 , (SEQ ID NO:4), 
the second (hybridizing to a portion of exon 7) was 

5'ATTTCTTCCATACTACTACCCATCGACCTCTCATC3' (SEQ ID NO: 5), the third, 
25 also hybridizing to a portion of exon 7 was 

5 , ATGAGGCCAGTGCGCCTTGGGGAGACCTGTGGCAAGC3 , (SEQ ID NO: 6); and 
finally, a probe against exon 8 had the sequence 

5'GAAAGGACMGGGTGGTTGGGAGTAGATGGAGCCTGG3' (SEQ ID NO: 7). A 10 
ul aliquot of each probe (20 pmol/capture) was added to a suspension containing 300 ul 
30 DNA in the presence of 310ul 6M GITC buffer for 2 hours at room temperature. Hybrid 
complexes were isolated using streptavidin-coated beads (Dynal). After washing, 
probe-bead complexes were suspended at 25° C for 1 hour in 0.1x TE buffer, pH7.4. 
The suspension was then heated for 4 minutes at 85° C, and the beads were removed. 
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Captured DNA was then amplified using PCR, essentially as described in U.S. 
Patent No. 4,683,202, incorporated by reference herein. Seven separate PCRs were 
run in duplicate using primers directed against Kras, ape, and p53. The primers listed 
below were used: 



PPR-A FOR 

rwrl-n run 


i\ras 


fZCfi fi.TC PfA A A A nnn TP A CTC CTC PTP A A a at^ apt r> a a rii 
0 oUb o 1 U UCA AAA bbb 1 OA G I C C 1 G C 1 G AAA ATG ACT GAA 3 

(SEQ ID NO: 8) 


39 






C 1 /OirttlnXft^rt P.TP CCA AAA P.P.P. TCA flTC ATP AAA ATP PTP a^a a a 

0 ^oiOUnjoOo o 1 U UOA AAA obb 1 CA u 1 C A 1 G AAA ATG GTC AGA GAA 
A3 1 (SEQ ID NO: 9) 


40 




Apr* i*ino 


C' Op/J AT/^ PPT III' PPP TC A rTT PTA PTT P AT TA*r hat -i- hi 

D bob o 1 C GC 1 1 1 1 boo 1 OA oTT GT A GTT CAT TAT CAT CTT T 3 
(SEQ ID NO: 10) 


40 


r On D*r\C V 




*5* ftlir\tirs\nCfi flTC flCT ITT CCC TP A CTC TTP PPT a /"» a «t/h i— 

O iDlOtinjolso o 1 0 bu 1 1 1 1 bbb 1 CA G I C ITC GCT CAC AGG ATC TTC 
A3* (SEQ ID NO: 11) 


40 




ADC <Q7fl 


C* PPP PTP PPA AAA PPP APA PT A PPP APA a * /"*» r^T*^ ^ Y /•-» a a t- «i 

o GCG GTC GCA AAA GGG AC A GTA GGC AC A AAG CTG TTG AAT 3 
(SEQ ID NO: 12) 


39 


PCR-C-REV 




5* (Biotin)GCG GTC GCA AAA GGG ACA GTT ATC AAG TGA ACT GAC AGA 
A3' (SEQ ID NO: 13) 


41 


PCR-D-FOR 


APC-1450 


5' GCG GTC CCA AAA GGG TCA GTC ACC TCC ACC ACC TCC TCA A 3' 
(SEQ ID NO: 14) 


40 


PCR-D-REV 




5* (Biotln)GCG GTC CCA AAA GGG TCA GTG TAT CAG CAT CTG GAA GAA 
3' (SEQ ID NO: 15) 


39 


PCR-E-FOR 


p53 Exon 5 


5* GCG GTC CCA AAA GGG TCA GTC CAT CTA CAA GCA GTC A 3' (SEQ 
ID NO: 16) 


37 


PCR-E-REV 




5* (Biotin)GCG GTC CCA AM GGG TCA GTC AGA CCT AAG AGC AAT CA 
3' (SEQ ID NO: 17) 


38 


PCR-F-FOR 


p53 Exon 7 


5* GCG GTC CCA AAA GGG TCA GAT ACC ACC ATC CAC TAC AA 3* (SEQ 
ID NO: 18) 


38 


PCR-F-REV 




5' (Biotin)GCG GTC CCA AAA GGG TCA GAG TAT GGA AGA AAT CGG TAA 
3* (SEQ ID NO: 19) 


39 


PCR-G-FOR 


p53 Exon 8 


5' GCG GTC CCT TTT GGG TCA CTC TGC CTC TTG CTT CTC TTTT 3' 
(SEQ ID NO: 20) 


40 


PCR-G-REV 




5' (Biotin)GCG GTC CCT TTT GGG TCA CTC TTG TCC TGC TTG CTT ACC T 
3' (SEQ ID NO: 21) 


40 



. Samples were heated to 94° C for 5 minutes, and then 40 cycles were 
conducted between 94° C, 60° C, and 72° C (1 minute each), followed by one cycle at 
72° C for 5 minutes. 

The presence of the 15 mutations listed in Table 1 above was determined by 
cycling single base extension (cycling SBE), essentially as described above and in co- 
pending, co-owned U.S. patent application serial number 09/067,212, incorporated by 
reference herein. Briefly, two reactions were run. In the first reaction, primer was 
hybridized adjacent the single base to be detected. 32 P-labeled nucleotide 
complementary to the expected mutant base and unlabeled dideoxy nucleotide 
complementary to the wild-type base were added. Primer was extended, and the 
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presence of labeled product indicated a mutation was present in the sample. A second 
reaction was run as a positive control in which only labeled wild-type complement was 
added to the reaction mixture. Primer extension incorporating the labeled base assured 
that the reaction was running properly. 
5 Primers used in SBE were as follows: 



Name 


Site 


Codon/Position 


S« 


SBE-A1 


Kras 


kl2p.l 


5' 


SBE-A2 


Kras 


k!2p.2 


5' 


SBE-A3 


Kras 


k!3p.2 


5' 


SBE-B1 


APC-1309 


1309 C 5) 


5* 


SBE-C! 


APC-1378 


1367 


5' 


SBE-C2 


APC-1378 


1378 


5' 


SBE-D1 


APC-1450 


1450 


5' 


SBE-E1 


p53 Exon 5 


175p.2 


5' 


SBE-F1 


p53 Exon 7 


245p.l 


5' 


SBE-F2 


p53 Exon 7 


245p.2 


5' 


SBE-F3 


p53 Exon 7 


248p.l 


5* 


SBE-F4 


p53 Exon 7 


248p.2 


y 


SBE-G1 


p53 Exon 8 


273p.l 


5 f 


SBE-G2 


p53 Exon 8 


273p.2 


5' 


SBE-G3 


p53 Exon 8 


282p.l 


5' 



Sequence 

AAC TTG TGG TAG TTG GAG CT 3' (SEQ ID NO: 22) 
ACT TGT GGT AGT TGG AGC TG 3* (SEQ ID NO: 23) 
TGT GGT AGT TGG AGC TGG TG 3* (SEQ ID NO: 24) 
AAA TAG CAG AAA TAA AA 3' (SEQ ID NO: 25) 
CTC CCT CCA AAA GTG GTG CT 3* (SEQ ID NO: 26) 
GTC CAC CTG TAC ACT ATG TT 3* (SEQ ID NO: 27) 
CTC AAA CAG CAC AAA CCA AG 3' (SEQ ID NO: 28) 
CAT GAC GGA GGT TGT GAG GC 3' (SEQ ID NO: 29) 
GTA ACA GTT CCT GCA TGG GC 3' (SEQ ID NO: 30) 
TAA CAG TTC CTG CAT GGG CG 3* (SEQ ID NO: 31) 
CCT GCA TGG GCG GCA TGA AC 3* (SEQ ID NO: 32) 
CTG CAT GGG CGG CAT GAA CC 3' (SEQ ID NO: 33) 
GAC GGA ACA GCT TTG AGG TG 3' (SEQ ID NO: 34) 
ACG GAA CAG CTT TGA GGT GC 3' (SEQ ID NO: 35) 
GTG CCT ATC CTG GGA GAG AC 3' (SEQ ID NO: 36) 



Reactions were performed under standard denaturation, annealing, extension 
cycling for 30 cycles and visualized on a 15% denaturing polyacrylamide gel. Counts 
10 per minute (CPM) from each cycling reaction were entered into a Packard Instant 
Imager (wire chamber counter). Percent allele heterogeneity was determined as: 

cpm mutant / cpm wild type 

cmp 1% mutant Ctrl / cpm 1% wild-type Ctrl 

A positive sample was defined as one in which at least one of the two replicates 
15 possessed a mutation with 1 % heterogeneity for at least one of the single bases that 
was analyzed. Any sample in which at least one of the genes analyzed showed a 
mutation in duplicate was considered positive. The results are summarized below in 
Table 2. The total numbers of patients in the cancer/adenoma and normal groups are 
shown under the column "patient status", 
20 TABLE 2 



Patient Status 


Lesions 
Detected By 
Colonoscopy 


Lesions 
Detected By 
SBE 


Sensitivity of 
SBE 


Specificity of 
SBE 


Cancer/Adenoma (30) 


21/9 


11/4 


52%/44% 


100%/100% 


Normal (10) 


0 


0 
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As shown in Table 2, multiple mutation analysis using SBE correctly identified 1 1 
out of 21 cancerous lesions identified by colonoscopy for a sensitivity of 52%. Multiple 
mutation analysis revealed 4 out of 9 adenomas for a sensitivity of 44%. In both cases, 
multiple mutation analysis using SBE correctly identified all disease-free individuals, 
5 resulting in no false positives (specificity of 100%). 

A fecal occult blood test, was run in parallel with the SBE test on all samples 
from patients diagnosed with an adenoma. Fecal occult blood testing failed to 
diagnosis any of the 9 adenoma-positive samples, and thus had a sensitivity of 0%. 
Accordingly, multiple mutation analysis has a far greater sensitivity and specificity than 
io the most common non-invasive technique currently available (fecal occult blood). 

Example 2 - Quantitative DNA Analysis 

In this experiment, the same 40 samples described in Example 1 were 
independently analyzed for their overall content of DNA in order to determine if the 
amount of amplifiable DNA in stools produced by individuals with cancer or precancer 
15 was different than the amount of amplifiable DNA produced in stools from cancer-free 
individuals. Samples were analyzed "blind", and later correlated to colonoscopy results 
as described below. 

Aliquots of the DNA obtained from the 40 patients described above in Example 1 
were amplified using the primers described above. Each sample was amplified through 

20 7 loci in duplicate (for a total of 14 amplifications for each locus). The products of PCR 
were placed on a 4% Nusieve (FMC Biochemical) gel (3% Nusieve, 1% agarose), and 
stained with ethidium bromide (0.5 ug/ml). The resulting amplified DNA was graded 
based upon the relative intensity of the stained gels. An "A" amplification produced the 
greatest cumulative intensity (and hence the greatest amount of DNA) after 40 cycles of 

25 PCR, "B n and U C amplifications produced proportionately less gel intensity; and tt F n 
amplifications produced no or little intensity. Figure 1 shows exemplary A, B, C, and F 
amplifications. There is sufficient reproducibility in PCR to allow the skilled artisan to 
classify an amount of amplifiable nucleic acid based upon standards for healthy and 
cancer populations, or based upon inspection of the gel photograph in Figure 1. The 

30 assay only requires that one differentiate U A" amplifications from any of tt B", U C", and "F" 
amplifications. An M A" amplification is one that has a band intensity of that in lane 1 of 
Figure 1 , or to another gel band of similar intensity (e.g., lanes 4, 5, 6, or 7 of Figure 1). 
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The markers to the right of Figure 1 show exemplary amounts of DNA giving rise to A, 
B, and C amplifications. Thus 200 pg of DNA (lane 12 in the Figure) results in an "A" 
amplification, and 100 pg (lane 13 in the Figure) results in a "B" amplification, and 50 pg 
(lane 14 in the Figure) results in a "C" amplification. 
5 Any DNA sample that produced 9 "A" amplifications out of a possible 14 was 

graded as positive for cancer or adenoma. The results are shown in Table 3 below: 
TABLE 3 



Patient Status 


Number of 
Patients 
Determined By 
Colonoscopy 


Number of 
Patients 
Determined By 
Quantitative 
DNA Analysis 
("A" Amps) 


Sensitivity of 
Quantitative 
DNA Analysis 


Specificity of 
Quantitative 
DNA Analysis 


Cancer/Adenoma 
(30) 


21/9 


14/5 


67%/56% 


100%/100% 


Cancer-free (10) 


0 


0 







As shown in Table 3, the amount of amplifiable DNA in stool is highly predictive 
10 of a patient's disease status. These results are consistent with the idea that patients 
with cancer or adenoma in the colon slough more cells (and therefore more DNA) onto 
the forming stool. Moreover, The DNA derived from cancer or adenoma cells is more 
intact than DNA derived from normal cells since cancer and adenoma cells have 
avoided apoptotic degradation of DNA. 

15 Example 3 - Combined Multiple Mutation and Quantitative Analysis 

The results obtained in Examples 1 and 2 above were combined to determine if 
further increases in sensitivity and specificity would be observed. 

A positive sample in this experiment was one which produced an "A w 
amplification, and produced a positive multiple mutation result (under the criteria 
20 described in Example 1). Samples were prepared and analyzed on a "blind" basis (i.e., 
without knowing colonoscopy results a priori) as described above. The results are 
shown in Table 4 below: 
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TABLE 4 



Patient Status 


Number of 
Patients 
Diagnosed By 
Colonoscopy 


Number of 
Patients 
Diagnosed By a 
Combination of 
Quant. And 
Multiple Mutation 


Sensitivity 


Specificity 


Cancer/Adenoma 


21/9 


16/7 


76%/78% 


100%/100% 



The results of the combined quantitative and multiple mutation analyses show a 
5 sensitivity of detection of 76% for cancer, and 78% for adenomas, each having a 

specificity of 100%. These results far exceed those of other non-invasive or minimally- 
invasive techniques (e.g., fecal occult blood testing which has a sensitivity of 0). 

Example 4 - Diagnostic Assay Using Bat-26 

The Bat-26 mismatch repair locus (shown in SEQ ID NO: 37) was next used to 
io assess the same 40 samples described above. Deletions in Bat-26 have been 

associated with colorectal cancer or adenomas. Samples were prepared as described 
above. A primer was hybridized to the portion of the Bat-26 locus immediately 
upstream of the poly-A tract. Unlabeled deoxy thymidine, a mixture of labeled and 
unlabeled deoxycytosine, and unlabeled dideoxy adenine were added along with 
15 polymerase. The primer was extended through the poly-A region. The labeled and 
unlabelled cytosine was extended for the next three bases (nucleotides 222-224, all 
guanines in the intact sequence) such that label was incorporated into each extended 
primer. After the poly-A tract and the three guanines, there exist two thymidines in the 
intact sequence. Thus, the dideoxy adenosine stops primer extension by addition at 
20 the end of a primer that has been extended through the poly-A and triguanine regions. 
Strands were separated, and the length of the strands was observed on a 
polyacrylamide gel to detect deletions in the poly-A tract. The results are presented 
below in Table 5; 
TABLE 5 



Patient Status 


Diagnosis By 
Colonoscopy 


Diagnosis By 
Bat-26 Detection 


Sensitivity of 
Bat-26 Detection 


Specificity of 
Bat-26 Detection 


Cancer/Adenoma 


21/9 


4/0 


19%/0% 


100%/0% 



As shown above, Bat-26 alone did not provide the high sensitivity achieved using 
multiple mutation or quantitation alone, but showed high sensitivity in comparison with 
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other single locus detection assays. Moreover, as shown below, Bat-26 in combination 
with the other techniques described above produced an overall increase in sensitivity 
and specificity. 

Example 5 - Cumulative Effects of Kras, Multiple Mutation, Quantitation, 
s and BAT-26 

The results obtained above for Kras, multiple mutation analysis, quantitation, and 
Bat-26 were combined to determine the cumulative effects of using combinations of 
those techniques in order to produce increased sensitivity and specificity in a non- 
invasive assay for cancer or precancer. The results are summarized below in Table 6: 



TABLE 6 



Assay Combination 


Kras and Quantitation 
and BAT-26 


Quantitation and 
BAT-26 


Multiple Mutation and 
Quantitation and 
BAT-26 


Sensitivity for 
Detection of 
Cancer/Adenoma 


80%/56% 


80%/56% 


90%/78% 


Specificity for 
Detection of 
Cancer/Adenoma 


100% 


100% 


100% 



As shown in the summary above, the combination of multiple mutation analysis, 
quantitative PCR, and Bat-26 produced a sensitivity approaching that of colonoscopy. 
15 A combination of multiple mutation analysis and quantitation alone also produces very 
high sensitivities. All assays resulted in a specificity of 100% (no false positive results), 
which is comparable to colonoscopy. 

The foregoing experiments show that even a single high-sensitivity/high 
specificity non-invasive or minimally-invasive assay produces diagnostic results that are 
20 superior to non-invasive/minimally-invasive techniques of the art, and approach results 
observed with the recognized standard invasive diagnostic procedure (colonoscopy). 
Moreover, a non-invasive assay utilizing more than one high-sensitivity/high-specificity 
technique results in diagnostic accuracy approaching 100%. As such methods of the 
invention provide a significant improvement in the ability to perform accurate non- 
25 invasive diagnosis of cancer. 
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CLAIMS 

What is claimed is: 

1 1 . A method for detecting a nucleic acid marker indicative of cancer or precancer in a 

2 tissue or body fluid sample, the method comprising the steps of: 

3 (a) selecting one or more informative nucleic acid markers; and 

4 (b) conducting in a sample obtained non-invasively or minimally-invasively 

5 from a patient an assay to identify at least one of said markers in said sample, said 

6 assay having a sensitivity of at least 60% and a specificity of at least 90%; 

7 thereby to detect in said sample a nucleic acid marker indicative of cancer 

8 or precancer. 

1 2. A method for detecting a nucleic acid marker indicative of cancer or precancer in a 

2 tissue or body fluid sample, the method comprising the steps of 

3 (a) performing a multiple mutation assay on a sample obtained non-invasively 

4 or minimally-invasively from a patient; 

5 (b) performing a nucleic acid amplification assay on said sample to detect an 

6 amount of amplifiable nucleic acid in said sample; and 

7 (c) identifying as a positive screen a sample selected from the group 

8 consisting of 

9 a sample in which at least one mutation is identified in step (a); and 

10 * a sample in which an amount of amplified nucleic acid obtained in step (b) 



1 1 exceeds a predetermined threshold. 

1 3. The method of claim 2, wherein said predetermined threshold is an amount of 

2 amplifiable nucleic acid obtainable in a sample from a patient who does not have 

3 cancer or precancer. 

1 4. The method of claim 3, wherein said amount is determined by comparing gel 

2 electrophoresis patterns of nucleic acids obtained by: 

3 (a) amplifying nucleic acid in a sample from said patient, and 

4 (b) amplifying nucleic acid in a sample from a patient known not to have cancer 

5 or precancer. 

1 5. The method of claim 4, wherein said amplifying steps are conducted 

2 contemporaneously. 
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6. The method of claim 1 or 2, wherein said precancer is an adenoma. 

7. The method of claim 1 or 2, wherein said sample is selected from the group 



2 consisting of stool, and a homogenate of stool. 



1 8, The method of claim 1 or 2, wherein said sample is selected from the group 

2 consisting of pus, sputum, urine, blood, cerebrospinal fluid, semen, and aspirate. 

1 9. The method of claim 1 or 2, wherein said multiple mutation assay comprises a single 

2 base extension assay. 

1 10. The method of claim 9, wherein said single base extension assay comprises the 

2 steps of: 

3 (a) annealing an oligonucleotide primer to a nucleic acid sample under conditions 
that promote exact complementary hybridization between said primer and a 

5 portion of a nucleic acid in said sample; 

6 (b) extending said primer by a single base; 

7 (c) separating said extended primer from said portion; and 

8 (d) identifying the base incorporated into said extended primer, thereby to identify 

9 said single base. 

1 11. The method of claim 9, wherein said single base extension assay comprises the 

2 steps of: 

3 (a) annealing an oligonucleotide primer to a nucleic acid sample under conditions 

4 that promote exact complementary hybridization between said primer and a 

5 portion of a nucleic acid in said sample; 

6 (b) exposing said sample to two different deoxynucleotides under conditions to 

7 promote primer extension 

8 (c) extending said primer until said primer can no longer be extended; 

9 (d) separating said extended primer from said portion; and 

10 (e) identifying the base incorporated into said extended primer, thereby to identify 

1 1 said single base. 

12. The method of claim 11, wherein at least one of said deoxynucleotides is 

2 detectably lab led. 

1 13. The method of claim 9, wherein said single base extension assay comprises 

2 identifying single nucleotides in each of a plurality of genomic loci. 



WO 00/58514 



PCT/US00/07882 



-24- 

14. The method of claim 13, wherein said genomic loci are selected from the group 
consisting of ape, Kras, p53, and bat-26. 

15. The method of claim 2, wherein said quantitative nucleic acid amplification assay 
provides a first quantity of amplifiable nucleic acid in said sample, and wherein said first 
quantity is compared to a second, reference quantity in order to determine whether said 
first quantity is indicative of cancer or precancer in said sample. 

16. A method for detecting a marker indicative of cancer or precancer in a tissue or 
body fluid sample, the method comprising: 

conducting a plurality of nucleic acid assays such that said plurality provides a 
sensitivity of detection of at least 60% and a specificity of detection of at least 90% for 
detection of a marker of cancer or precancer in a sample obtained non-invasively or 
minimally-invasively from a patient 

17. The method of claim 16, wherein said plurality comprises from 2 to 5 assays. 

18. The method of claim 16, wherein said plurality comprises a quantitative polymerase 
chain reaction assay and an assay for a mutation in bat-26. 

19. The method of claim 18, wherein said assay for a mutation in bat-26 comprises a 
primer extension assay to identify fragments of said bat-26. 

20 The method of claim 16, wherein said precancer is an adenoma. 

21. The method of claims 1, 2, or 16, wherein said cancer is colorectal cancer. 

22. A method for detecting in a heterogeneous sample the presence of nucleic acid 
a marker of cancer or precancer comprising the steps of 

(a) obtaining a tissue or body fluid sample non-invasively or minimally-invasively 
from a patient; 

(b) performing an assay on said sample to detect at least one of said marker 
with a sensitivity for detection of at least 50% and with a specificity for detection of at 
least 90%; and 

(c) identifying a sample as positive if at least one of said marker is detected in 
said performing step. 
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23. A method for detecting in a heterogeneous sample the presence of nucleic acid 



2 marker of cancer or precancer with a sensitivity of detecting said marker of at least 50% 



and a specificity of detecting said marker of at least 90%, the method comprising 

4 (a) conducting at least two assays for the detection of said marker in said 

5 heterogeneous sample obtained non-invasively or minimally-invasively from a patient, 

6 and 

7 (b) identifying a positive sample as one in which at least one of said indicia is 

8 detected in at least one of said assays. 



24. The method of claim 23, wherein said assays are selected from the group 
consisting of multiple mutation detection, quantitative polymerase chain reaction, 
sequence-specific hybrid capture, oligo-ligation, amplification refractory mutation 

4 system, single-stranded conformational polymorphism detection, sequencing, mismatch 

5 detection, and single base extension. 

1 25. The method of claims 1 , 2, 16, 22, or 23, wherein said sensitivity and said 

2 specificity are determined by comparing results obtained in said screening method with 

3 results obtained using an invasive diagnostic method. 

l 26. The method of claim 25, wherein said invasive diagnostic method is a colonoscopy. 
27. The method of claim 26, wherein said invasive diagnostic method is conducted 



2 contemporaneously with said screening method. 



28. The method of claim 27, wherein said invasive diagnostic method is conducted 
within from about 3 hours to about 3 months of the time at which said screening method 



3 is conducted. 



29. The method of claim 22, wherein said assay is an assay to detect loss of 



2 heterozygosity in at least a portion of a chromosomal arm. 

1 30. The method of claim 29, wherein said assay to detect loss of heterozygosity is 

2 selected from the group consisting of enumerative LOH and single base extension. 



1 31. The method of claim 29, wherein said chromosomal arm is selected from the group 

2 consisting of chromosome 17p ( chromosome 5p, chromosome 8p, chromosome 1q, 

3 and chromosome 18q. 



WO 00/58514 



PCT/US00/07882 



-26- 

32. A method for detecting nucleic acid marker indicative of cancer or precancer in a 
tissue or body fluid sample, the method comprising the steps of: 

amplifying nucleic acid in a sample obtained non-invasively or minimaliy- 
invasively from a patient in order to obtain an amplifiable amount of nucleic acid; and 

identifying a sample containing a marker indicative of cancer or precancer based 
upon said amount, wherein the sensitivity of detection is least 60% and the specificity of 
detection is at least 90%; 

33. The method of claim 32, wherein said amount is compared to standard amounts 
known to exist in samples comprising cancer or precancer. 
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SEQUENCE LISTING 

<110> Lapidus, Stanley N 
Shuber, Anthony P 

<120> Methods for improving sensitivity and specificity of 
screening assays 

<130> EXT-030PC 

<140> 
<141> 

<150> US 09/277,016 
<151> 1999-03-26 

<160> 37 

<170> Patentln Ver. 2.0 

<210> 1 

<211> 37 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (1) . . (37) 
<223> K-ras probe 

<400> 1 

gtggagtatt tgatagtgta ttaaccttat gtgtgac 37 

<210> 2 

<211> 37 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<222> (1) . . (37) 

<223> ape probe ape- 1309 

<400> 2 

ttccagcagt gtcacagcac cctagaacca aatccag 37 

<210> 3 

<211> 37 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> (1)..(37) 

<223> ape probe apc-1378 

<400> 3 

cagatagccc tggacaaaca atgccacgaa gcagaag 37 

<210> 4 
<211> 37 
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<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> (1) . . (37) 
<223> p53 probe 



<400> 4 

tactcccctg ccctcaacaa gatgttttgc caactgg 37 

<210> 5 

<211> 35 

<212> DNA 

<213> Homo sapiens 

<220> 

<22l> misc_feature 
<222> (1) . . (35) 
<223> p53 probe 



<400> 5 

atttcttcca tactactacc catcgacctc tcatc 

<210> 6 

<211> 37 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 
<222> (l) . . (37) 
<223> p53 probe 



<400> 6 

atgaggccag tgcgccttgg ggagacctgt ggcaagc 37 

<210> 7 

<211> 37 

<212> DNA 

<213> Homo sapiens 

<220> 

<22l> misc_feature 
<222> (1) . . (37) 
<223> p53 probe 

<400> 7 

gaaaggacaa gggtggttgg gagtagatgg agcctgg 37 

<210> 8 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PCR-A FOR 
(K-Ras) 



<400> 8 
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gcggtcccaa aagggtcagt cctgctgaaa atgactgaa 

<210> 9 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PCR-A-REV 
(K-Ras) 

<400> 9 

gcggtcccaa aagggtcagt catgaaaatg gtcagagaaa 

<210> 10 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PCR-B- FOR 
(APC-1309) 

<400> 10 

gcggtcgctt ttgggtcagt tgtagttcat tatcatcttt 

<210> 11 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PCR-B -REV 
(APC-1309) 

<400> 11 

gcggtcgctt ttgggtcagt cttcgctcac aggatcttca 

<210> 12 

<211> 39 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PCR-C-FOR 
(APC-1378) 

<400> 12 

gcggtcgcaa aagggacagt aggcacaaag ctgttgaat 

<210> 13 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

•<223> Description of Artificial Sequence :PCR-C-REV 
(APC-1378) 

<400> 13 
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gcggtcgcaa aagggacagt tatcaagtga actgacagaa 



40 



<210> 14 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PCR-D- FOR 
(APC-1450) 

<400> 14 

gcggtcccaa aagggtcagt cacctccacc acctcctcaa 40 

<210> 15 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :PCR-D-REV 
(APC-1450) 



<210> 16 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PGR- E- FOR (p53 
Exon 5) 

<400> 16 

gcggtcccaa aagggtcagt ccatctacaa gcagtca 37 

<210> 17 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PCR-E-REV (p53 
Exon 5) 



<210> 18 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :PCR-F- FOR (p53 



<400> 15 

gcggtcccaa aagggtcagt gtatcagcat ctggaagaa 



39 



<400> 17 

gcggtcccaa aagggtcagt cagacctaag agcaatca 



38 



Exon 7) 



<400> 18 
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gcggtcccaa aagggtcaga taccaccatc cactacaa 

<210> 19 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PCR- F-REV (p53 
Exon 7) 

<400> 19 

gcggtcccaa aagggtcaga gtatggaaga aatcggtaa 

<210> 20 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PCR- G- FOR (p53 
Exon 8) 

<400> 20 

gcggtccctt ttgggtcact ctgcctcttg cttctctttt 

<210> 21 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PCR-G-REV <p53 
Exon 8) 

<400> 21 

gcggtccctt ttgggtcact cttgtcctgc ttgcttacct 

<210> 22 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<220> 

<22l> misc_feature 
<222> (1)..(20) 
<223> SBE-A1 

<400> 22 

aacttgtggt agttggagct 

<210> 23 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<220> 

<22i> misc_feature 
<222> (1) . . (20) 
<223> SBE-A2 
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<400> 23 

acttgtggta gttggagctg 

<210> 24 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (1) . . (20) 
<223> SBE-A3 

<400> 24 

tgtggtagtt ggagctggtg 

<210> 25 

<211> 17 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (1) . . (17) 
<223> SBE-B1 

<400> 25 

aaatagcaga aataaaa 

<210> 26 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<220> 

<22l> misc_feature 
<222> (1) . . (20) 
<223> SBE-C1 

<400> 26 

ctccctccaa aagtggtgct 

<210> 27 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> (1)..(20) 
<223> SBE-C2 

<400> 27 

gtccacctgt acactatgtt 

<210> 28 

<211> 20 

<212> DNA 

<213> Homo sapiens 
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<220> 

<221> misc__feature 
<222> (1) . . (20) 
<223> SBE-D1 

<400> 28 

ctcaaacagc acaaaccaag 

<210> 29 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (1) . . (20) 
<223> SBE-E1 



<400> 


29 


catgacggag gttgtgaggc 


<210> 


30 


<211> 


20 


<212> 


DNA 


<213> 


Homo sapiens 


<220> 




<221> 


misc feature 


<222> 


(1) . . (20) 


<223> 


SBE-F1 


<400> 


30 


gtaacagttc ctgcatgggc 


<210> 


31 


<211> 


20 


<212> 


DNA 


<213> 


Homo sapiens 


<220> 




<221> 


misc feature 


<222> 


(1) . . (20) 


<223> 


SBE-F2 


<400> 


31 


taacagttcc tgcatgggcg 


<210> 


32 


<211> 


20 


<212> 


DNA 


<213> 


Homo sapiens 


<220> 




<221> 


misc feature 


<222> 


(1) . . (20) 


<223> 


SBE-F3 



20 



20 



20 



<400> 32 

cctgcatggg cggcatgaac 



20 
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<210> 33 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> tnisc_f eature 

<222> (1) . . (20) 

<223> SBE-F4 

<400> 33 

ctgcatgggc ggcatgaacc 

<210> 34 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (1) . . (20) 
<223> SBE-G1 

<400> 34 

gacggaacag ctttgaggtg 

<210> 35 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<220> 

<22l> misc_f eature 
<222> (1) .7(20) 
<223> SBE-G2 

<400> 35 

acggaacagc tttgaggtgc 

<210> 36 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<220> 

<22l> misc_feature 
<222> (1) . . (20) 
<223> SBE-G3 

<400> 36 

gtgcctatcc tgggagagac 

<210> 37 

<211> 314 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc feature 
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<222> (1) . . (314) 

<223> BAT- 26, wherein each M n" corresponds to a 
nucleotide of unknown identity 



<400> 37 

ccagtggtat agaaatcttc gatttttaaa 
gtctgcggta atcaagtttt tagaactctt 
actgactact tttgacttca gccagtatat 
taaccttttt caggtaaaaa aaaaaaaaaa 
gttaannnnn nnngacagat agtgaagaag 
caatattaga caag 



ttcttaattt taggttgcag tttcatcact 60 
atcagatgat tccaactttg gacagtttga 120 
gaaattggat attgcagcag tcagagccct 180 
aaaaaaaaaa agggttaaaa atgttgattg 24 0 
gcttagaaag gagctaaaag agttcgacat 300 

314 



